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PREFACE 



The Wisconsin R & D Center is designed to contribute to an understanding 
of, and the improvement of educational practices related to, cognitive learning 
by children and youth. Of primary concern in attaining the goal of the Center 
are the learning of concepts and the mirturing of related cognitive skills. Con- 
ditions within the learner, such as motivation or cognitive organization, and 
conditions within the learning situation, such as the content and sequence of 
instruction, are also relevant areas of research. 

In this technical report is described the successful extension of learning 
principles to complex human behavior, namely reading behaviors, of highly 
retarded readers. The instructional program and procedures developed by 
Staats were administered to junior high school students by previously untrained 
personnel. Implications for the treatment of behavior problems in clinical 
psychology as well as in education are drawn. 



Herbert J. Klausmeier 
Co-Director for Research 
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ABSTRACT 



A method of treating reading deficits based upon an extrinsic motivational 
system previously employed successfully with a single subject was extended to 
18 additional subjects. The junior high age subjects included retarded children 
in special classes, several emotionally disturbed children, and culturally de- 
prived children. The method of training used in the original study was designed 
to be simple to administer and simple to record the performance of the child. 
Thus, r.t was hypothesized that subprofessional personnel could be employed to 
administer the treatment. Adult volunteers and high school seniors were used 
as the instructional-technicians. 

The 18 subjects were given 38.2 hours of training, in daily half-hour ses- 
sions, during which period the average reinforcement earned was $22. 29. The 
meannumberof single word reading responses was 94,425. Tharateof reading 
accelerated over the period of training even though the reading material be- 
came more difficult. This occurred during a period when progressively less 
(about one-fourth as much) reinforcement was given per reading response. A 
mean of 616.8 new words were learned and 70.9% of these were retained in a 
long term test. The attention, attendance, cooperation, and diligent work be- 
havior of the various children were maintained in good strength throughout the 
duration of the study. 
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abstract This report describes a method of treating reading deficits based upon an 
ixtrinsic motivational system. The Ss were 18 junior high students including 
retarded children in special classes, several emotionally disturbed children, and 
culturally deprived children. The method of training was designed to be simple 
snough that subprofessional personnel could be employed to administer the treatment. 
Ldult volunteers and high school seniors were used as the instructional-technicians, 
[s were given 38.2 hours of training, in daily half-hour sessions, during which 
>eriod the average reinforcement earned was $22.29. The mean number of single 
rord reading responses was 94,425. The rate of reading accelerated over the period 
>f training even though the reading material became more difficult. This occurred 
luring a period when progressively less (about one -fourth as much) reinforcement 
ras given per reading response. A mean of 616.8 new words were learned and 70.9% 

>f these were retained in a long term test. The attention, attendance, cooperation, 
ind diligent work behavior of the various children were maintained in good strength 
:hroughout the duration of the study. 
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abstract 



This report describes a method of treating reading deficits based upon an 
extrinsic motivational system. The Ss were 18 junior high students including 
retarded children in special classes, several emotionally disturbed children, and 
culturally deprived children. The method of training was designed to be simple 
(enough that >ubproressional personnel could be employed to administer the treatment. 
•Ado. It volunteers and high school seniors were used as the instructional-technicians. 
ISs were given 38.2 hours of training, in daily half-hour sessions, during which 
period the average reinforcement earned was $22.29. The mean number of single 
>ord reading responses was 94,425. The rate of reading accelerated over the period 
,of training even though the reading material became more difficult. This occurred 
during a period when progressively less (about one-fourth as much) reinforcement 
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I 

INTRODUCTION 



The first author (see Staats, 1965a, 1965b.. 
in press; Staats and Staats, I963)has suggest- 
ed a general learning theory of human be- 
havior called an integrated -functional learning 
approach. An essential feature of the approach 
is that its principles and methods must be 
capable of dealing with functional human be- 
haviors and contribute to the treatment of actu- 
al problems of human behavior. One aspect 
of this has been the formulation of a theory of 
language learning and cognitive development 
(Staats, in press) — including a theoretical and 
experimental analysis of reading acquisition. 
This has involved laboratory studies of basic 
learning principles in the context of the com- 
plex behavior of reading acquisition (Staats, 
Staats, Schutz, and Wolf, 1962; Staats, Finley, 
Minke, and Wolf, 1964; and Staats, Minke, 
Finley, Wolf, and Brooks, 1964). 

In addition, on the basis of the previous 
findings and analyses, a method was devised 
for the treatment of children who because of 
learning problems have not been able to read 
normally. The method, which involved a sys- 
tem of extrinsic reinforcement employing to- 
kens, was tested in a study on a 14-year-old, 
culturally-deprived, juvenile delinquent 
(Staats and Butterfield, 1965). The study, 
involving 40 hours of training conducted over 
a 4-1/2 month period, was extraordinarily 
successful. The child, who was a severe be- 
havior problem in a traditional classroom 
situation, worked and attended well in the ex- 
perimental training program, he made over 
65,000 word reading responses, he received 
special training on 761 words he did not know, 
he retained 430 of those words (57%), and he 
passed all his courses in school (thefirst time 
hehad ever passed a course in his whole his- 
tory), his misbehaviors in schoolfell off mark- 
edly, and his general attitudes towards school 
and school work appeared to improve. In. 
addition, achievement tests showed an in- 
crease from the 2. 0 grade level in reading to 
the 4. 3 level. 



One of the major points of the treatment 
procedure, however, was the fact that it was 
designed to be easy to administer and record. 
This was done to solve the practical problems 
involved in its wide-spread use for treatment 
and for research. Thus, in the first study 
the procedure was applied by a person un- 
trained in teaching, a probation officer. This 
has many implications for education and clin- 
ical psychology. 

The procedures are very specific and 
relatively simple. Thus, it was not neces- 
sary tohave a person highly trained in ed- 
ucation to administer the training . . . . 
It might be suggested that anyone with a 
high school education and the ability to 
read could have administered the training. 
1 his has implications for the practical ap- 
plication of the presentmethods . . . [Staats 
and Butterfield, 1965, p. 939]. 

It was also suggested that the procedure 
could be applied in many settings, involving 
various types of clinical, educational, and 
social problems. For example, it was sug- 
gested that helper'- of teachers— who will be 
called instructional -technic ians herein — could 
apply the methods in special education train- 
ing programs. 

This study was based upon the results of 
a single subject, however. The next step in 
this program of research was a more general 
test of the procedure. In addition, the present 
study begins to assess the suggestion of the 
first study that learning problems can be 
treated by sub-professionals; in this case lit- 
erate high school seniors and adult volunteers 
with high school educations serve as the in- 
structional-technicians. The 18 children who 
served as experimental subjects included 
underachieving junior high school aged chil- 
dren with no special characteristics except 
their learning problem, children considered 
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to be mentally retarded and in special classes, 
and several children with emotional or be- 
havioral problems. Nine of the Ss each had 



one of 9 high school seniors as the instruc- 
tional-technician, and the other Ss each had 
one of 9 adult volunteers as the instr? tional- 
technician. 



< 
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II 

METHOD 



SUBJECTS 

Eighteen experimental and 18 control Ss 
were selected for participation in the project 
in the following manner: Lists of students 

defined as poor readers on the basis of stand- 
ard achievement tests and enrolled in 7th or 
8th grade or in Special Classes were obtained 
from five junior high schools in Madison, 
Wisconsin. These lists were then circulated 
to the teacher sof the various schools with the 
instructions to delete any names they felt were 
not reading problems and to add any students 
to the list whom they felt had been omitted. 
Final selection of Ss for participation in the 
study was done on the basis of a 100-item 
word recognition test developedfrom the read- 
ing material. The 36 Ss selected (28 male 
and 8 female) w<:re those most deficient on 
this oral test, reading 81 or fewer of the words 
contained. Subjects were grouped in thirds, 
from low to high, on the basis of their scores 
on the oral reading test, and subsequently 
divided into 18 pairs, matched on the basis 
of their score on this word recognition test 
and, where possible, on the basis of IQ scores. 
The 18 Ss for the experimental group were 
obtained by randomly selecting one of the two 
students from each matched pair. 

The averageage at the start of the experi- 
ment was 14 years, 6 months; the average IQ 
was 74 (69 Verbal and 80 Non-verbal). Seven 
of the experimental Ss and seven control Ss 
were enrolled in classes for the educable 
mentally retarded. A number of Ss were 
classified as educationally disadvantaged un- 
der provisions of the new Federal Education 
Act. 

EXPERIMENTERS 

Two types of experimenters were utilized 
in this study. Nine adult volunteers were ob- 
tained through the P. T. A.'s of the participating 
schools and through the efforts of the indi- 



vidual principals. These volunteers were paid 
$2.00 a session to cover transportation, baby- 
sitting fees, etc. The adult volunteers were 
all housewives with an average of 4 children 
living at home. Their average age was 42, 
and their families earned approximately $7,500 
per year. 

In addition, nine high school seniors were 
obtained from East Senior High School and 
Central Senior High School. These students 
were selected by first circulating a list of the 
senior class among the teachers at the high 
school and having the teachers give the stu- 
dents a combined rating o f above average, 
average, or below average on the character- 
istic sof maturity, responsibility, and accept- 
able reading level. Students receiving a be- 
low average rating from any teacher were 
eliminated, and the remainder served as the 
pool from which volunteers were sought. 
These Es were paid $1. 25 per 1/2-hour ses- 
sion, and participated i n the study 5 times 
per week during their regularly scheduled 
study halls. The seniors had a mean grade 
average of B for their 4 years inhigh school. 

The instructional* technicians were trained 
intheuseof the procedures in two 1-1/2 hour 
sessions just prior to the beginning of the 
project. This training included some experi- 
ence in actually administering the materials. 
In addition, a brief third training session was 
held at the end of the second week to generally 
instruct the instructional-technicians on an 
error in procedure committed by one of the 
adult volunteers and to indicate some minor 
changes in procedure. 

INSTRUMENTS 

The primary test used, both for grouping 
purposes and as a dependent variable, con- 
sisted of 100 words randomly selected from 
the reading materials used in the experimental 
sessions consisting of 20 words selected from 
each of five grade levels (1. 2, 1.7, 2. 3, 3.0, 



3 



and 4.0). Subjects had to correctly pronounce 
the word shown them on a flash card in order 
to receive credit. The pretest was given be- 
fore assignment to treatment condition, and 
the posttest was administered by persons un- 
familiar with the experiment. 

Other tests, primarily of the standardized 
variety, were given during the screening of 
subjects, specifically the Iowa Test of Basic 
Skills , Grade 7, Form 2, and the Lorge- 
Thorndike Intelligence Test , Level D, Form 
1. It was planned to give alternate forms of 
these tests as posttests so the Ss' pre- and 
post -experimental performance could be com- 
pared. Many subjects, however, as will be 
discussed further on, appeared to be answer- 
ing some of their tests randomly, and their 
chance scores substantiated this suspicion. 

The same result was shown on the posttest 
where scores on the reading and vocabulary 
subtests of the Iowa Test of Basic Skills, 
Grade 7, Form 3, made it apparent thatmany 
of the subjects were still operating at chance 
expectancy. Accordingly, four additional 
sessions were utilized in which easier tests 
(the Lorge- Thorndike Intelligence Test , Level 
C, Form 2; and the California Reading 
Achievement Test. Elementary, Grades 4-6, 
Form W; and Forms 1 and X of the same two 
tests, respectively) were administered. As 
part of the attempt to salvage the test data, 
the extrinsic reinforcement method was ex- 
tended to the testing situation during the last 
two sessions when the alternate forms were 
given. The alternate forms were thus given 
with the addition of the following reinforce- 
ment contingency instruction: 

The test you are to take now involves apro- 
cedure you should understand thoroughly. 
You will be able to earn money for doing 
this test— that is, by working hard, reading 
each item, and not guessing on items you 
do not know. You will get two cents for 
every question that you answer correctly— 
so the more questions you answer correctly 
the more you will make. However, you 
will lose one penny for every incorrect 
answer. Thus, if you do not read questions, 
and just guess at the answers, you will end 
up making nothing. Remember you get two 
cents for each question you answer cor- 
rectly, and lose one cent for each incor- 
rect answer. 

Tests were scored, and students paid by en- 
velopes a few minutes after these two con- 
tingency test sessions ended. 



REINFORCER SYSTEM 

There were three types of token, distin- 
guished by color. The tokens were of diffe rent 
value in terms of the items for which the to- 
kens could be exchanged. Initially, a blue 
token was valued at 1/5 of one cent, a yellow 
token at 1/2 of a cent, and a red token at 1 
cent; however, at the end of the first week the 
tokens were devalued, and for the remainder 
of the study the value of the blue, yellow, and 
red tokens was 1/10, 1/5, and 1/2 of one cent 
respectively as in the original study. 

During the eleventh week of the study a 
bonus system was introduced for a few slow 
children in order to maintain a $• 20 minimurn 
in terms of s daily earnings. From this 
point on, whenever S's earnings fell below 
$. 20 for any one session, hewas given a bonus 
of $.05, $.10, or $.15, whichever amount 
was needed in order to bring his earnings for 
that day up to between $. 20 and $. 25. When 
the bonus system was introduced, the child 
was told that he could not move as rapidly as 
before because the material upon which he was 
working was difficult, but if he continued to 
workhard he would be given a bonus at the end 
of the session. 

The child' s acquisition of tokens was plotted 
so that visual evidence of the reinforcers was 
available. The tokens could be used to pur- 
chase a variety of items. These items, chosen 
by the subject, could range in value from pen- 
nies to whatever the subject wished to work 
for. Records were kept of the tokens earned 
by and of the manner in which the tokens 
were used. 



READING MATERIALS 

The reading material, as in the earlier 
study (Staats and Butterfield, 1965), was taken 
from the Science Research Associates (SRA) 
reading -kit materials, Reading Laboratories 
IA, IB, IC, and II A. The SRA kits consist of 
stories developed for and grouped into grade 
levels. For the purposes of this study, stories 
were taken from the 1.2 (20 stories), 1.4 (60 
stories), 1.7 (20 stories), 2. 0 (32 stories), 
2.3 (12 stories), 2.6 (12 stories), 3.0 (32 
stories), 3. 5 (32 stories), and 4. 0(10 stories) 
grade levels. Once a particular Reading Lab- 
oratory was selected for inclusion at a given 
grade level, all the stories at that grade level 
were presented in sequential order (with the 
exception of the 4. 0 grade level, where only 
the first 10 stories in Laboratc *y IIA were 
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presented). The different numbers of lessons 
presented at each grade level were due to the 
use of different numbers of Reading Labora- 
tories, in order to control somewhat the rate 
of introduction of new words. 

Each story includes a series of questions 
which can be used to assess the reader's 
comprehension of the story. The reading 
training program which again was that of the 
earlier study may be summarized as follows. 

Vocabulary Words 

A running list was made of the new words 
that appeared in the series of stories. Each 
different form of a word was counted as a dif- 
ferent word for this purpose; thus, bring , 
brings , and bringing were all counted as dif- 
ferent words. The list finally included each 
different word that appeared in the stories, a 
total of 4,253 words. From this list, the new 
vocabulary for each story was selected, and 
each word was typed on a separate 3x5 rard. 

The average number of new words intro- 
duced in each story was 18. 5, the least num- 
ber of new words being introduced in any one 
story being 6, and the most being 50. The 
few stories presenting over 30 new words 
occurred in the second half of the program, 
after the children had already been presented 
a large number of lessons. 

Oral Reading Materials 

Each paragraph in the SR A stories was 
typed on a 5 x 8 card. Each story could thus 
be presented to S paragraph by paragraph. 

Silent-Reading and Comprehensive-Question Materials 

Each SRA story and its comprehensive 
questions were typed on separate 8-1/2 x 11 
sheets of white paper. 

PROCEDURE 

Vocabulary Presentation 

Theprocedure for each story in the series 
commenced with the presentation of the new 
words introduced in that story. The words 
were presented individually on the cards, and 
S was asked to pronounce them. A correct 
response to a word-stimulus card was even- 
tually reinforced with a mid^ value token. After 
a correct response to a word, the card was 
dropped from the group of cards yet to be 



presented. The S was instructed to indicate 
words that he did not know the meaning of, and 
this information was provided in such cases. 

When an incorrect response to a word 
stimulus occurred, or when S gave no re- 
sponse, the instructional-technician gave the 
correct response. The S. then repeated the 
word while looking at the stimulusword. How- 
ever, the word card involved was returned to 
the group of cards still to be presented. A 
card was not dropped from the group until it 
was read correctly without prompting. After 
an error on a word stimulus, only a low-value 
token was given for finally reading the word 
cor rectly without prompting. The vocabulary- 
presentation phase of the training was con- 
tinued until each word was read correctly 
without prompting. 

Initially, the appropriate tokens were de- 
livered immediately contingent upon each un- 
prompted correct reading trial. However, by 
the end of the second week it was apparent 
that the instructional-technicians were having 
difficulty presenting materials, watching S, 
keeping data, and delivering tokens all at the 
same time. So at the beginning of the third 
week, all tokens were delivered at the end of 
each phase rather than after each reading 
response. 

Oral Reading 

Upon completion of the vocabulary mate- 
rials, each paragraph was individually pre- 
sented toS in the order in which the paragraph 
occurred in the story. When correct reading 
responses were made to each word in the 
paragraph, a high-value token was given upon 
completion of the paragraph. When a para- 
graph contained errors, S was corrected, and 
he repeated the word correctly while looking 
at the word. The paragraph was put aside, 
and when the other paragraphs had been com- 
pleted, the paragraph containing errors was 
again presented. The paragraph was repeated 
until it was read correctly in its entirety — 
at which time a mid-value token was pre- 
sented. When all paragraphs in a story had 
been completed correctly, the next phase of 
the training was begun. 

When a few of the Sshad some difficulty in 
reading the words in sequential order, either 
failing to respond to one or more words in the 
sequence, or adding words for which no tex- 
tual stimuli were presented, the instructional- 
technician would point to each word in turn and 
S would then read them. It was generally pos- 
sible to later drop this additional procedure 
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without any loss in the control over Ss' 
performance. 

Silent Reading and Comprehensive Questions 

Following the oral reading S was given the 
sheet containing the story. He was instructed 
to read the story silently, and he was told that 
it was important to read to understand the 
story so that he could answer the questions 
which would be presented later. 

Initially, S was presented reinforcement 
in the form of a yellow token on a 15-second 
variable interval schedule. That is, as long 
as S appropriately scanned the material he 
was given a mid -value token an average of 
every 15 seconds. However, th^rewasa great 
deal of variability in the abilities of the in- 
structional-technicians to administer this type 
of schedule, and so at the beginning of the 
third week the procedure was changed such that 
four yellow tokens were delivered at the end 
of the silent reading of the story, regardless 
of the time needed to complete the task. 

Upon completion of the story, S wrote his 
answers to the questions typed on a separate 
sheet and gave his answers to the instructional- 
technician. For each correct answer, S re- 
ceived a high-value token. For an answer 
with a spelling error, S was reinforced with 
a mid-value token when he had corrected the 
answer. For incorrect answers S had to re- 



read the appropriate paragraph, correct his 
answer, and he then received a low-value 
token. 

Vocabulary Review 

Some of the vocabulary words presented to 
in the first phase of training were words he 
already could read. Many others, however, 
were words that the procedure was set up to 
teach. The oral -reading -phase performance 
indicated the level o£S' s retention of the words 
he had learned — and also provided further 
training trials on the words not already learn- 
ed. A further assessment of S's retention of 
the words that he did not know in the vocabu- 
lary training was made after each 20 stories 
of the SRA materials had been read. This 
test of individually presented words, for each 
story, was started immediately after com- 
pletion of the 20 stories and constituted fairly 
long-term retention. 

This test was also used as a review for S, 
and further training on the words was given. 
When S couldnot read a word, or missed one, 
he was prompted and had to correctly repeat 
the name of the word while looking at the word. 
This word card was then put aside and pre- 
sented later, at which time S was reinforced 
with a low-value token if he read it correct- 
ly. If not, the procedure was repeated until 
a correct unprompted trial occurred. 
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Each of the 18 subjects participated in the 
study over the period of the study with excel- 
lent cooperation, attention, and work behav- 
iors. In the original study, the juvenile de- 
linquent subject made about 65,000 single 
word reading responses in 40 hours of train- 
ing. The mean number of hours of training 
in the present study was 38.2. During this 
period the mean number of word reading re- 
sponses made by the subjects was 94, 425. 

The present sample of subjects included 
children with varying degrees of initial read- 
ing skill, as indicated by their ability to read 
the 100 individual words taken from the SRA 
materials in the way that has been described. 
Thus, the percentage of these words that could 
be read by the children prior to the training 
varied from a low of 22 percent to a high of 
81 percent. In order to better see how the 
training affected the subjects as a function of 
their initial reading ability some of the re- 
sults to be presented will be grouped accord- 
ing to thelevelof the children’s ability. Thus, 
the 18 Ss were grouped into 3 subgroups. The 
lowest had a mean percentage of reading the 
SRA words of 43. 2 prior to the study. The 
mean IQ of these children on the Lorge- 
Thorndike Intelligence Test was 71.9, rang- 
ing from 50 to 91. The middle group had a 
mean percentage of 69. 2 on the SRA words, 
and a mean IQ of 74. 2, with a range from 53 
to 94. The high group had a mean percentage 
of 79.0 on the SRA words, and a mean IQ of 
79. 9, with a range of 66 to 92. 

The total number of words the child read 
during the training was tabulated. This in- 
cluded the words he read when singly pre- 
sented on cards, the words he read aloud in 
the paragraph reading phase, the words he 
read in the silent reading phase, the words 
read in the comprehensive questions, as well 
as the words read in the vocabulary review. 
The cumulative record of the words read over 
the period of training for the three groups of 
*5s is given in Figure 1. In the original study 
of Staats and Sutterfield the record produced 



indicated that the rate of reading accelerated 
slightly during the period of training— even 
though the reading materials became steadily 
more difficult. A similar result is shown for 
the Ss in the present study. (The later points 
on the curves, where indicated, do not include 
all the Ss in the group, since some Ss did not 
participate as many hours as others "did. ) The 
6 Ss with the highest reading ability show the 
greatest acceleration, the middle group the 
next highest, and the low group the least. 
This ac celeration takes place even though the 
children advance from 1. 2 grade reading ma- 
terials to 3.5 grade level materials. It does 
appear, however, that the increase in diffi- 
culty does dampen the acceleration in reading 
rate of all the Ss particularly those with low 
reading ability. In any event, as in the orig- 
inal study, the children's behavior of attend- 
ing to the task and making the appropriate 
reading responses did not diminish throughout 
the period of training. Thus, the token rein- 
forcement system employed was capable of 
maintaining the work behaviors of these chil- 
dren for a long period of time. During this 
time the attentional and cooperative behaviors 
instigated resulted in many, many, learning 
trials— a necessary aspect for the acquisition 
of achievement in any skill. 

Records were kept of the number of words 
the children missed on first presentation, the 
number of these words which were then later 
missed in the oral reading of the paragraphs, 
as well as the number of the words originally 
missed that the child could not read on the 
review test presented at the later time. On the 
original study the juvenile delinquent missed 
761 words on first presentation, 585 (or 
about 77%) of these were retained in the oral 
reading phase, and 430 (about 57%) were re- 
tained in the review test when presented singly. 
The corresponding mean words missed on first 
presentation for the low ability group in the 
present study was 820. 3. In the oral reading, 
686. 5 (or 83. 7%) were retained. In the long- 
term retention test, where the words were 
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Figure 1. Cumulative number of reading responses as a function of the time in reading train- 
ing for low, middle, and high initial reading ability groups. The letter (a) on each 
curve represents the last point in which all Ss in that group are represented. 



presented singly, 433. 3 (57.4%) were re- 
tained. (Since the vocabulary review occurred 
at the end of each 20 lessons, most Ss were 
presented with a number of words in the last 
few sessions which were not tested in this 
manner.) The performance of this group of 
Ss was thus roughly comparable to that of the 
hr st subject. The middle ability group missed 
535 words on first presentation, retained 
483.8 (or 90.4%) on second presentation, and 
retained 306. 2 (or 69. 1%) on the long-term 
review test. The means for the high ability 
group were 425. 2 firstmissed, 360.3 (or 84%) 
retained on second test, and 291. 0 (or 78. 7%) 
on review test. 

Evidence indicated that the 100-word test 
taken from the SRA reading materials was a 
valid indicator of the Ss' reading skill with 
the SRA materials. That is, the low, middle, 
and high groups on this measure(with respec- 
tive reading percentages of 43.2, 69.2, and 
79. 0) performed in that order of ability on the 
reading task itself. On the measure of the 



percentageof the words in the various stories 
that could be read on first presentation the 
means of the three groups were 67. 7%, 82. 1%, 
and 87. 8%. In addition, similar results were 
shown on the measure of long term retention, 
the vocabulary review. The percentages re- 
tained correctly on this measure for the three 
groups were 57. 4%, 69. 1%, and 78. 7%. 

In the original study there seemed to be 
someevidence that as the training progressed 
the subject missed fewer of the words on first 
presentation. This type of evidence would 
tend to indicate that the subject had been learn- 
ing to sound out new words as a function of 
the training. Additional evidence (Staats, 
1965b) has shown that subjects can learn syl- 
labic reading units from training on whole 
word reading tasks. To test the possibility 
that improvement in sounding out new words 
occurred in the present study, the ratio of 
words missed the first time to the total pre- 
sented was computed for each 20 stories pre- 
sented. The results for the 3 groups are 
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shown in Figure 2. The later points on each 
curve do not include all the subjects in the 
group, since some Ss did not read as many 
stories as others did. In any event, there 
does not appear to be evidence that the ratio 
decreased, but rather that it was constant 
throughout the training except for an increase 
in the ratio from the 1 . 2 grade level to the 
i . 4 grade level of the reading material. The 
question of whether phonetic reading skills 
emerge from this type of trainingmay be con- 
sidered to remain unanswered. More direct 
test will probably be necessary. 

One of the important aspects of the pro- 
cedures involves the ratio of reinforcement 
for the reading responses. The procedures 
were designed t o progressively reduce the 
amount of reinforcement given per reading re- 
sponse as the training progressed— or con- 
versely to require more reading responses 
per unit of reinforcement. As was indicated 
in the original study, demonstration that this 
is possible on a long-term training program 
has a number of important implications. Such 
a demonstration "is in part an answer to the 
question whether the use of extrinsic rein- 
forcers in training will produce a child who is 



dependent upon these reinforcers" (Staats and 
Butterfield, 1965, p. 941). 

Figure 3 supports the original demonstra- 
tion by showing that the ratio of the amount of 
reinforcement earned divided by the number 
of words read decreases as a function of num- 
ber of training sessions. Bythetime60 train- 
ing sessions have been completed the children 
were receiving about one-fourth as much re- 
inforcement for their reading behavior as they 
had at the beginning of the training. This was 
trueof the 3 groups. This result is especially 
interesting in view of the acceleration of rate 
of reading response shown in Figure 1. That 
is, in the period of time during which the re- 
inforcement per reading response is cut in 
quarter, the rate of reading increases. Aside 
from its theoretical and practical implications 
for child learning, it is interesting to note that 
the result would be expected directly from 
basic laboratory research with lower organ- 
isms, on the effects of reinforcement sched- 
ules. Thus, it is important to see that the 
basic principles hold even in the context of 
such a complex and uniquely human behavior 
as reading. 
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Figure 2. The ratio of words missed upon initial presentation to the total number of words 
presented in the Individual Word Phase as a function of the number of^SRA stories 
read for die three ability groups. The letter (a) on each curve represents the last 
point in which all Ss in that group are represented. 
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Figure 3. Ratio of the monetary value of the tokens received divided by the number of reading 
responses made as a function of time in reading. The letter (a) on each ability 
group curve represents the last point in which all Ss are represented, and the letter 
(b) indicates the time block in which the bonus was introduced for most Ss. 



As has been described, the test data con- 
cerning reading achievement appeared to be 
invalidated by the test-taking behavior of the 
children. In addition to that, midway through 
the experiment special reading programs were 
begun under the Elementary and Secondary 
Education Act Title I so that the control group 
could no longer be considered as such. The 
experiment, however, had been designed so 
that analyses of covariance could be made of 
the data. The main effects were the experi- 
mental treatment (experimental versus con- 
trol) and the division into high, middle, and 
low groups on the basis of the first oral word 
reading (100 SRA words). The pretests were 
used as the covariates for the various posttest 
dependent variables. Although a detailed pre- 
sentationof the design and analysis is not pro- 
ductive here, there are a few points that are 
relevant, as well as a few analyses of addi- 
tional measures. 
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First, the analysis indicated that the test 
to be the main measure of reading achieve- 
ment, the Iowa Test of Basic Skills , was in- 
deed not producing reliable results. There 
was not even a significant relationship between 
the Ss' pre- and posttest results, indicating 
that the Ss were randomly scoring their test 
items. It is also interesting to note that the 
I. T.B.S. was not related to the oral reading 
of the SRA words— a measure with consider- 
able face validity as well as empirical valid- 
ity in the study. There was a significant re- 
lationship between the pre- and posttest per- 
formance on the Lorge-Thorndike . However, 
there was no significant effect of the experi- 
mental treatment as compared to the control 
group results. The California Achievement 
Test was given twice (alternate forms) in the 
posttest sessions in an effort to salvage the 
test results. There was not a significant re- 
lationship to the pretest I. T.B.S. however, 




when the test was taken without reinforcers 
and only a significant relationship (at the . 01 
level) of the vocabulary part with the pretest 
I. T.B.S. when the test was taken with rein- 
forcers. However, the analysis of covariance 
didnot reveal any effect on the C.A. T. of the 
experimental treatment, when the pretest I. 
T.B.S. was used as the covariate. The C.A. 
T. vocabulary part taken with no reinforce- 
ment was significantly related (. 01 level) to 
the oral reading measures (SRA words), how- 
ever, th e reading comprehension part was 
not. Under the reinforcement procedures both 
parts of the C. A. T. were significantly related 
to the pretest oral reading measure (the vo- 
cabulary at the . 001 level and the reading part 
at the . 05 level). 

The individually administered oral reading 
measure composed of the 100 SRA words ap- 
peared to be a reliable measure to the extent 
that thepre- and posttest measures were sig- 
nificantly related (at the . 001 level). Further- 
more, this measure did show a significant ef- 
fect(at the .01 level) of the experimental treat- 
ment. The experimental group subjects im- 
proved more on their ability to read the 100 
words than did the control group. The ex- 
perimental group had an original mean score 
of 63. 8 on this measure and a posttest score 
of 76. ^corresponding scores for the control 
group were 63. 6 and 69. 2. 

Fourteen of the 18 control Ss improved up- 
on their oral reading— probably, at least in 
part, as a result of the special reading pro- 
grams to which they also had been exposed, 
some of which used SRA materials,. It is in- 
teresting to note that although each of the ex- 
perimental S^s improved in this measure of 
reading ability, some by large amounts, the 
greatest improvement was shown by the Ss 
who were originally lowest in the oral reading 
ability. The mean improvement for the low, 
medium, and high ability subgroups in the 
experimental group was respectively 18.3, 
8. 8, and 9. 8. The mean improvement for all 
the groups combined was 12. 2 words. 

Another analysis of variance was condi cted 
to analyze the effect of introducing the rein- 
forcement contingencies upon the test-taking 



behaviors, and the test results, of the Ss for 
the California Achievement Test (both vocab- 
ulary and reading sections), and the Lorge- 
Thorndike on the verbal, non-verbal, and total 
scores. The main effects were the Experi- 
mental Treatment, Previous Oral Reading 
Ability (that is, the 3 groups, low, middle, 
and high) and the Reinforcement Contingency 
Condition. The results showed that the rein- 
forcement procedure for the test administra- 
tion produced significant effects upon the 
California Achievement Test. On the vocab- 
ulary the Ss advanced from a mean of 23. 56 
under no reinforcement to a mean of 26. 00 
with the reinforcement contingencies. This 
result was significant at the . 05 level as was 
the next re suit. On the reading comprehension 
part the Ss advanced from a mean of 35. 53 to 
38. 50. The results for the Lorge -Thorndike 
were: on the verbal an advance from a mean 
of 67.83 to 71. 36, on the non-verbal an advance 
from a mean of 80. 89 to 85.89, on the total 
IQ an advance from 74. 36 to 78. 62. These 3 
results were significant at the . 001 level. 

This improvement occurred even though 7 
fewer items, on the average for the two tests, 
were answered by theSs under the reinforce- 
ment contingency conditions. Thus, the chil- 
dren's scores under the no-reinforcement 
conditions were spuriously inflated since some 
of those 7 additional items would have been 
correct by chance. At any rate, it may be 
concluded that the children's test behaviors 
and test results were affected, and in a posi- 
tive direction, when they were reinforced for 
correct test-taking behavior and punished for 
random responding. 

Two types of instructional-t echnicians 
were employed in the study and it is of some 
interest to compare the results of their sub- 
jects for possible differences. As measured 
by the pre-and posttest on the 100-word SRA 
oral reading measure, the mean gain in words 
read was 12. 56 for the adult-volunteer in- 
structional-technicians and 12. 1 1 for the high 
school -student ins t ructions 1 -technicians. 
Thus, neither group of instructional- techni- 
cians was differentially effective. 



IV 

DISCUSSION 



The general procedure and the reinforcer 
system appear to be generally functional. The 
attention, attendance, cooperation, and dili- 
gent work behavior of the various children 
was maintained in good strength throughout 
the length of the study. In fact these be- 
haviors appeared to be gaining strength. Thus, 
the methods have proved successful with an 
incorrigible juvenile delinquent and with the 
present group of 18 children which included 
mentally retarded children and emotionally 
disturbed children. In addition, the proce- 
dures have also been tested with the same 
success in clinical work with several psychi- 
atrically referred children, one of whom was 
diagnosed as schizoid, as well as with ongoing 
research which will be mentioned further on 
with culturally -deprived problem learners, 
emotionally disturbed children, and younger 
children with learning problems. 

Ithas been suggested (Staats, 1965a, 1965b, 
in press, in preparation; Staats and Staats, 
1963) that learning principles and procedures 
must be extended— in the realms of clinical, 
educational, and child psychology— to the study 
and treatment of complex human behaviors in 
long-term studies . The present methods in- 
dicated that this maybe done with detailed and 
objective observations of the process— while 
the subjects are being treated for behavior 
problems. In this case the problem was a case 
of cognitive deficit. 

It should be stressed that this was possible 
employing sub-professionals as the instruc- 
tional- (or therapy-) technicians. In the first 
study a probation officer served in this role. 
Inthepresent case the technicians werehouse- 
wives and high school students. As was out- 
lined in the first Uiie of these procedures 
(Staats and Butterfield, 1965), this finding has 
general implications for various areas of 
clinical psychology and education in which in- 
dividual treatment of behavior problems is too 
costly when professionals are employed, but 
where the child does not adjust to the group 
training situations. The following conclusions 



made in the first study are thus supported by 
the present results. 

Thus, the procedures could be widely ap- 
plied or adapted by various professionals, 
for example, social workers, prison of- 
ficials, remedial teachers, (special edu- 
cation teachers], tutors, and so on. In an 
even more economical application, helpers 
of professionals could be used to actually 
administer the procedures; for example, 
selected delinquents (or prisoners) could 
administer the procedures to other delin- 
quents. Thus, the procedures could be 
utilized in various situations, such as set- 
tlement houses, homes for juvenile delin- 
quents, prison training programs, parts of 
adult education, and so on. . . . [T]he 
results suggest possibilities for economic 
innovations in education generally [Staats 
and Butterfield, 19651. 

Related to this is the matter of the cost of 
the treatment. In the first study the 4-1/2 
months of trainingcost $20. 31 in reinforcers. 
This was clos ely paralleled in the present 
study. That is, the mean cost for the subjects 
over a similar period was $22. 29. In many 
cases this would be a very minimal price to 
pay to insure that ordinarily untrainable chil- 
dren would be not only amenable to various 
types of instruction, but would work diligently 
and learn well. 

Moreover, not only was the behavior of 
these childrenvery appropriate in the learning 
situation, but the behavioral measures indi- 
cated that the children covered a great deal of 
material and learned a number of new words. 
A mean of 616. 8 words were presented that 
the children did not at first know. After train- 
ing, in the long-term retention measure, the 
children retained a mean of 70. 9% of the words . 
This represents a straightforward indication 
that the children (including 7 children in class- 
es for the retarded) were learning the material 
on which they were being trained. In addition, 
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the curves that indicate the rate at which the 
children were reading was accelerating, or 
remaining constant, even though the difficulty 
of the material was increasing. 

These findings were supplemented by the 
differences between the experimental and con- 
trol subjects on their ability to read the 100 
SRA words when individually presented to each 
child. The experimental group increased in 
this index of reading ability to an extent that 
was significantly greater than the increase of 
the control group. Thus, when the measure- 
ment of reading ability was controlled and 
directly observed, and when the reading stim- 
uli were among thoseon which the experimen- 
tal group had received training, the effect of 
the experimental treatment was seen over the 
whoie group as compared to the control group. 
This occurred even though the control group 
had begun to receive special training in read- 
ing during the study. 

However, the standard test results did not 
show a difference between the pre- and post- 
test measures, for the experimental over the 
control group. Several alternative interpre- 
tations of this lack of effect on the tests are 
available. The training materials may not 
produce general skills that transfer to other 
tasks, such as taking an achievement test. If 
that was the case it would be an indictment 
against the SRA materials at least when pre- 
sented in the present fashion. This interpre- 
tation is difficult to accept, however. Pro- 
vided the vocabulary included in the materials 
has generality, learning the words should by 
itself improve the child's reading ability for 
any verbal material that included the words. 
Nevertheless, it is possible that more train- 
ing would be necessary to be reflected on the 
tests, that the reading materials used might 
have been too elementary for many of these 
subjects, and so on. 

Another possibility is that the tests them- 
selves were insensitive dependent variables 
in the present instance. For example, the 
tests might have been too difficult. It is in- 
teresting to note that the California Achieve- 
ment T e 8 1 , but not the Iowa Test of Basic 
Skills, was significantly related to the oral 
reading scores. Perhaps if the more ele- 
mentary C.A.T. had been given in the pre- 
test instead of the I. T. B. S. , the experimen- 
tal effect might have been shown. It is even 
more likely, however, that the test -taking be- 
havior of these children was at fault. The 
children all had long histories of unsuccess- 
ful (which means unreinforced) test taking. 
It would be e xp ec te d under those circum- 



stances that the behavior would deteriorate 
and be replaced by less effortful behaviors— 
that is indiscriminant marking of items. This 
weakness would have been amplified by the 
use of achievement tests that were too diffi- 
cult for the children, since it would prevent 
the children from obtaining reinforcement in 
taking the test. It should be noted that in the 
Staats and Butterfield study, where the tests 
did reflect the success of the training, the 
testing was individually administered rather 
than a group procedure, which guaranteed 
better test-taking behavior to some extent, 
and the tests were easier and contained fewer 
items (both of which would enhance the rein- 
forcement properties of the testing situation). 

It should also be noted that the special train- 
ing given to the control group children prob- 
ably helped obscure the positive affects of the 
experimental treatment. This, of course, 
would also have had an affect upon the results 
of the individually administered word reading 
measure. 

It should be added, however, that these are 
matters for further research, some of which 
is now underway. The first two authors have 
a study now in progress in which the tests and 
testing circumstances have been improved. 
The study involves culturally-deprived chil- 
dren who are more homogeneously low in their 
reading deficits. The instructional-technicians 
will be drawn from a culturally-disadvantaged 
population. Adrian Van Mondfrans and the 
first author are also extending the methods 
and materials to younger children (4th, 5th, 
and 6th graders), with and without extrinsic 
reinforcement. And Judy McBurney and the 
first author have a study in which emotionally 
disturbed children are being individually test- 
ed in greater detail during the course of the 
treatment. Although these studies are not yet 
completed, it is again evident that the proce- 
dures and reinforcer system work universally 
well with various children with various types 
of problems. 

The reinforcement system used in the Staats 
and Butterfield study as well as the present 
study was designed so that the amount of rein- 
forcement given per unit of behavior decreases 
over the period of the training. Thus, during 
the training the child was gradually getting 
less reinforcement for his reading behavior 
even though his behavior was maintained in as 
good or better strength. This result is in part 
an answer to the frequent question concerning 
the u s e of extrinsic reinforcement in child 
training— that is, that the child may become 
dependent upon the reinforcers. It may be 
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suggested that sophisticated methods may be 
designed to produce the desirable behavior but 
with no lasting dependence upon a-i artificial 
system of reinforcement. 

One more point will be mentioned here. 
The reinforcement procedure which was in- 
troduc ed to produc e better test-taking behavior 
has general significance. The children who 
had been randomly scoring items began to 
peruse each item more carefully under the 
extrinsic reinforcement procedure. The re- 
sults indicated that when the good test-taking 
behavior was reinforced and poor test-taking 
behavior punished the desired behavior in- 
creased in strength to a statistically signifi- 
cant extent. When reinforced the children 
scored better on the tests, even while answer- 
ing few * test items. This has theoretical 
implications; for differences in "motivation" 
are not usually controlled when testing dif- 
ferent groups of subjects, even though there 



are many categories of individuals whose mo- 
tivational circumstances would no doubt affect 
their test measurements. It may be suggested 
that any group comparisons on tests should 
take into account likely differences in test- 
taking behavior as a function of motivational 
differences. It is suggested that this could be 
controlled in theoretical studies by insuring 
the subjects of the experiment are reinforced 
for good test-taking behavior. 

The present findings also have clinical and 
educational significance, however. There are 
many cases in which test results are part of 
diagnosis and treatment and where the test- 
taking behavior?! of the subject prevent accu- 
rate appraisal. It is suggested that more valid 
test results could be obtained through the use 
of reinforcement to strengthen the "good test 
behavior. " The first author plans additional 
research in this area of study. 
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